Mixed Observability Predictive State Representations
نویسندگان
چکیده
Learning accurate models of agent behaviours is crucial for the purpose of controlling systems where the agents’ and environment’s dynamics are unknown. This is a challenging problem, but structural assumptions can be leveraged to tackle it effectively. In particular, many systems exhibit mixed observability, when observations of some system components are essentially perfect and noiseless, while observations of other components are imperfect, aliased or noisy. In this paper we present a new model learning framework, the mixed observability predictive state representation (MO-PSR), which extends the previously known predictive state representations to the case of mixed observability systems. We present a learning algorithm that is scalable to large amounts of data and to large mixed observability domains, and show theoretical analysis of the learning consistency and computational complexity. Empirical results demonstrate that our algorithm is capable of learning accurate models, at a larger scale than with the generic predictive state representation, by leveraging the mixed observability properties.
منابع مشابه
Transfer from Multiple Linear Predictive State Representations (PSR)
In this paper we tackle the problem of transferring policy from multiple partially observable source environments to a partially observable target environment modeled as predictive state representation. This is an entirely new approach with no previous work, other than the case of transfer in fully observable domains. We develop algorithms to successfully achieve policy transfer when we have th...
متن کاملRcd Rules and Power Systems Observability
Power system state estimation is a process to find the bus voltage magnitudes and phase angles at every bus based on a given measurement set. The state estimation convergency is related to the sufficiency of the measurement set. Observability analysis actually tests this kind of problem and guarantees the state estimation accuracy. A new and useful algorithm is proposed and applied in this pape...
متن کاملObservability-Enhanced PMU Placement Considering Conventional Measurements and Contingencies
Phasor Measurement Units (PMUs) are in growing attention in recent power systems because of their paramount abilities in state estimation. PMUs are placed in existing power systems where there are already installed conventional measurements, which can be helpful if they are considered in PMU optimal placement. In this paper, a method is proposed for optimal placement of PMUs incorporating conve...
متن کاملModel Predictive Control - Ideas for the next Generation
Mixed Logical Dynamical (MLD) systems are introduced as a new system type. The MLD form is capable to model a broad class of systems arising in many applications: linear hybrid systems; sequential logical systems (finite state machines, automata); nonlinear dynamic systems, where the nonlinearity can be expressed through combinational logic; some classes of discrete event systems; constrained l...
متن کاملModelling Sparse Dynamical Systems with Compressed Predictive State Representations
Efficiently learning accurate models of dynamical systems is of central importance for developing rational agents that can succeed in a wide range of challenging domains. The difficulty of this learning problem is particularly acute in settings with large observation spaces and partial observability. We present a new algorithm, called Compressed Predictive State Representation (CPSR), for learn...
متن کامل